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Description 

FIELD OF THE INVENTION 

5 [0001 ] The present invention relates to a site-specific endonuclease which recognizes a specific nucleotide sequence, 
to a gene coding for the endonuclease. to a recombinant vector containing the gene, to a transformant containing the 
vector, and to a process for producing the endonuclease. 

BACKGROUND OF THE INVENTION 

10 

[0002] Endonuclease is a nuclease (nudeic add degrading enzyme) which hydrotyzes the phosphodiester bond of a 
polynucleotide chain. Endonuclease recognizes and binds to a specific nudeotide sequence along DNA molecules, 
whereby molecules within the recognition sequence is cut Endonudease is a requisite enzyme for today's advanced 
> gene engineering techniques for doning and analyzing genes. 

IS [0003] A srte-specif ic endonudease Endo.Scel (hereinafter, also refen-ed to as "Scel") from an eucaryotic microor- 
ganism (e.g., yeast) is known to be a heterodimer having subunits of 75 kDa and 50 kDa. The subunits of Seel as well 
as genes encoding the subunits have been cloned, and the nucleotide sequences thereof have been determined (for 
75 kDa subunit. see Morishima. N. et al.. J. Biol. Chem. 265. 15189-15197 (1990) and for 50 kDa subunit. see JP-B-7- 
77556). 

20 [0004] In order to widely utilize the above-described endonuclease for artificially modifying a biochemical agent, a 
gene or the like, the endonuclease needs to be mass-produced with a gene expression system. The endonudease 
does not function unless it recognizes a specific nudeotide sequence, I.e., the endonuclease needs to be specific to the 
nudeotide sequence to be recognized. 

[0005] The 50 kDa subunit of the above-descrit>ed endonuclease Seel is encoded by mitochondrial gerK)mes of yeast 
25 (Saccharomyces cerevisiae), A gene of a mitochondrial genome of yeast contains codons unique to mitochondria 
which are different from amino acid codons (universal codons) used in gene expression systems from organisms gen- 
erally used for mass expression of protein {E.coli, baculovirus. yeast etc.). If this gene of the mitochondrial genome is 
directly used, the protein expression system hardly produces a protein of an original amino acid sequence. For exam- 
ple, while TGA is a stop codon as a universal codon. it is a different codon coding for other amino acid (Trp) in mito- 
30 chondria. A gene may be normally expressed in mitochondria but expression of the same gene may not result in a 
complete protein in a general expression system such as E.co/i due to incomplete translation caused by the stop codon. 

SUMMARY OF THE INVENTION 

35 [0006] The present invention aims at providing a site-specific endonudease wtiich recognizes a specific nucleotide 
sequence, to a gene coding for the endonuclease, to a recombinant vector containing the gene, to a transformant con- 
taining the vector, and to a process for producing the endonudease. 

[0007] The present inventors have gone through intensive studies to solve the above-described problems. As a result, 
they succeeded in producing a modified endonuclease capable of recognizing and cleaving a specific nucleotide 
40 sequence by substituting, in a gene encoding an amino acid sequence of the smaller subunit of an endonudease from 
yeast, codons unique to mitochorKlria with universal codons, and in mass-expressing the endonudease, whereby the 
present invention was accomplished. 

[0008] Accordingly, the present Invention relates to an endonudease capable of recognizing the nucleotide sequence: 
GCCCAGACATATGCGTGAATGATACC or a fragment thereof that comprises the sequence information necessary to be 
45 spedfically recognised and/or cleaved by the endonuclease of the invention. When in the following reference is made 
to a fragment of the 26 nucleotide sequence mentioned above, it is to be understood that said fragment confers the abil- 
ity to be specifically recognised and/or cleaved by the endonudease of the invention. The person skilled in the art 
knows how to determine saki functions without further ado. for example, by applying methodology involving gel shift 
assays. 

50 [0009] Further, the present inventk>n relates to a recomtMnant protein of either (a) or (b): 

(a) a protein comprising the amino add sequence represented by SEQ ID NO:3; or 

(b) a protein having an endonudease activity for recognizing the nudeotide sequence: GCCCAGACATATCCCT- 
GAATGATACC or a fragment thereof, the proton comprising at least one deletion, substitution or addition of amino 

55 add in the amino add sequence represented by SEQ ID NO:3. 

[001 0] Moreover, the present Invention relates to a gene encoding the recombinant protein of either (a) or (b): 
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(a) a protein comprising the amino acid sequence represented by SEQ ID NO:3; or 

(b) a protein having an endonuclease activity for recognizing the nucleotide sequence: GCCCAGACATATCCCT- 
GAATGATACC or a fragment thereof, the protein comprising at least one deletion, substitution or addition of amino 
acid in the amino acid sequence represented by SEQ ID NO:3. 

[001 1 ] In addition, the present invention relates to a gene containing DNA of eittier (c) or (d): 

(c) DNA comprising the nucleotide sequence represented by SEQ ID NO:2: or 

(d) DNA encoding a protein having an endonuclease activity for recognizing the nucleotide sequence: GCCCAGA- 
10 CATATCCCTGAATGATACC or a fragment thereof, the DNA being, capable of hybridizing with DNA which com- 
prises the nucleotide sequence represented by SEQ ID NO:2 under stringent conditions. 

[0012] Furthermore, the present invention relates to a recombinant vector comprising the above-described gene. 
[0013] Additionally, tiie present invention relates to a transformant comprising the above-desaibed recombinant vec- 
15 tor. 

[0014] Moreover, the present invention relates to a process for producing the endonuclease, comprising tiie steps of: 
culturing the above-described transformant: and recovering from the culture an endonuclease capable of recognizing 
the nucleotide sequence: GCCCAGACATATCCCTGAATGATACC or a fragment thereof. 

[0015] The present Invention also relates to an endonuclease produced by the process of the present invention. 

20 [0016] Finally, the present invention relates to a kit comprising the endonuclease and/or the recombinant protein 
and/or the gene and/or the recombinant vector and/or the transformant of the present invention. 
[0017] The components of the kit of the invention may be packaged in containers such as vials, optionally in buffers 
and/or solutions. If appropriate, one or more of said components may be packaged in one and the same container 
[0018] this specification includes part or ail of tiie contents as disclosed in the specification and/or drawings of Jap- 

25 anese Patent Application No. 10-141 861 which Is a priority document of the present invention: 

BRIEF DESCRIPTION OF THE DRAWINGS 
[0019] ' 

30 

Fig. 1 shows amino acki sequences of an endonuclease before and after the modification; 

Fig. 2 shows the steps for constructing plasmid pY673L; 

35 Fig. 3 shows the steps for constructing plasmids pEN1.7 and pENO.5; 

Fig. 4 shows the nucleotide sequence of 50 kDa subunit gene of Seel wWch has been modified to conform tiie uni- 
versal code; 

40 Figs. 5A and 5B are photographs of electrophoresis showing sequence-specific endonuclease activities of the 50 
kDa subunit of the modified Seel; 

Figs. 6A and 68 show substitution site of the 50 kDa subunit from Saccharomyces uvarum and oligonucleotides 
used for the substitution; and 
45 ' ■ . . 

Figs. 7A and 78 are photographs of electrophoresis showing sequence-specific endonuclease activities of the 50 
kDa subunit from Sacc/7a/'o/77yces u\/aru/77. 

DESCRIPTION OF THE PREFERRED EMBODIMENTS 

so 

[0020] Hereinafter, tiie present invention will be described in more detail. 

[0021 ] The present invention aims at mass-expressing mitochondrial genome DNA encoding the smaller subunit (50 
kDa) of an endonuclease from yeast by using a protein expression system such as E.coli or yeast In accomplishing 
this aim, the present invention modifies, in a gene coding for an amino acid sequence of the snfialler subunit, codons 
55 unique to mitochondria into universal codons. The present invention relates to such a niodified smaller subunit capable 
of recognizing arxi cleaving 26 base pairs of the specific nucleotide sequence. ... 
[0022] An endonuclease of the present invention (i.e., the 50 kDa subunit of an endonuclease from yeast; hereinafter 
also referred to as "Endo-Scel 50 kDa7 is prepared as follows. ^ 
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(1) Designing mutated anuno add and introducing mutation 

[0023] According to the present invention, the smaller subunit of endonudease Seel from Saccharomyces cerevisiae 
or the smaller subunit of endonudease Suv\ from Saccharomyces uvarum is used as a target for introdudng a ntuta- 
5 tion. The smaller subunits of t>oth Seel and Suv\ have molecular weights of 50 kDa. However, they differ from each 
other for having 2 different amino acids (Fig. 6A). 

[0024] The gene coding for the subunit (50 kDa) of Seel (hereinafter, refen-ed to as ''ENS2") is encoded b/ a mito- 
chondrial genome, and thus contains genetic codes unique to mitochondria (Table 1). 



Table 1 



20 



Difference between mitochondriat code and universal code 




Amino add to be translated 


Codon 


Universal code 


Mitochondrial code 


TGA 


STOP 


Trp 


ATA 


lie 


Met 


CTA or CTT 


Leu 


Thr 



[0025] The nucleotide sequence of ENS2 is known (JP-B-7-77556; Nakagawa, K.. Morishima. N., and Shibata. T. J. 
Biol. Chem. 266, 1977-1984 (1991)). When ENS2 is expressed in a general expression system such as E,coli accord- 
ing to the universal code, the translation is interrupted at TGA where it is read as a stop codon as can be appreciated 

25 from Table 1 (for exanrple. ENS2 described in JP-B-7-77556 indudes a stop codon TGA at nudeotides 97-99). While 
ATA is read as He according to the universal code, it is read as Met according to the mitochorxirial code. 
[0026] In order to construct a normal mass-expression system for ENS2, it is necessary to modify the genetic code 
of ENS2 such that the amino acid sequence obtained upon expression in a general expression system (e.g.. E.coli) is 
identical to an amino acid sequence as expressed in a mitochondrial expression system. Thus, according to the present 

30 invention, a codon for Trp (TGA) according to the mitochondrial code (Table 1) is substituted for a codon (TGG) that will 
be translated into Trp according to the universal code. Such substitution is also applied to ATA, and CTA and CTT that 
are translated irtto lie and Leu, respectively, according to the universal code (Tat>le 1). There is no need of si^stituting 
other degenerating codons which code for He or Leu according to the universal code. 

[0027] Basically, there are 37 amino acids as the candidates for modification within the amino add sequence of the 
35 smaller subunit of Seel (476 amino adds). Their positions are shown in Rg. 1 and Table 2. As to the endonudease from 
Saccharomyces uvarum (Sa vl), Gly at 21 7 and Asn at 346 (Fig. 1) are additionally substituted for Lys and Asp, respec- 
tively, so that a total of 39 amino acids are modified. It is not necessary to substitute all of the above 37 or 39 amino 
adds. The number of substitution may be 36. or 35 or less. Translations into codes unique to mitochondria may not be 
complete as long as the 26 nucleotides (SEQ ID NO:1 ) mentioned later are recognized by the endonudease. The posi- 
40 tions of substitution are summarized in Table 2 below. 



Table 2 



Position of substitution 


Amino add to k>e trans- 
lated before modification 


Amino acid to be trans- 
lated after modification 


33. 54. 247. 320. 433 


STOP 


Trp 


35. 40, 45, 48. 65. 80. 86. 92, 107. 109. 111. 123, 154. 163. 
168. 171. 177. 248. 313. 335. 347. 399. 465 


lie 


Met 


49. 99. 130. 135. 222. 267. 276. 395. 426 


Leu 


Thr 



[0028] Substitution of the amino adds is conducted by substituting the nucleotide sequence of the gene encoding the 
amno adds for another nucleotide sequence (site-directed mutagenesis). Examples of mutagenesis indude but not 
55 limited to the srte-directed mutagenesis method by T Kunkel (Kunkel. TA., Proc. Natl. Acad. Sci. U.S.A. 82. 488-492 
(1985)) and the Gapped duplex method. There is also a modified version of Kunkel method in which a maximum of 16 
oligonudeotides for modification are simultaneously used finstead 6f using 1 or 2 oligonucleotides as the usual Kunkel 
method) to efficiently substitute a plurality of sites. According to the present invention, mutation can be introduced by 
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using a mutation introduction kit (for example. Mutant-K (Takara Shuzo. Co.. Ltd.) or Mutant-G (Takara Shuzo, Co.. 
Ltd.)) ttiat utilizes site-directed mutagenesis, or by using LA PGR in vitro Mutagenesis series kit (Takara Shuzo, Co., 
Ltd.). 

[0029] The oligonucleotides are designed and synthesized using the nucleotide sequence of ENS2 as a template 
5 (1431 base pairs: Nakagawa. K. et al.. J. Biol. Chem. 266. 1977-1984 (1991); JP-B-7-77556) such that at least one 
base that is to be introduced with the mutation is flanked by about 8 to 30 bases (each oligonucleotide having a total of 
about 18 to 60 bases). The oligonucleotides can be obtained through chemical synthesis using a usual synthesizer. 

(2) Preparation of endonuclease gene which has been introduced with mutation 

10 

[0030] Each of the oligonucleotides obtained as described In (1) above is phosphorylated at 5* end. synthesized using 
ENS2 as a tenplate and subjected to ligation reactions. These reactions can be performed using T4 Polynucleotide 
Kinase (Takara Shuzo. Co. Ltd.). T4 DNA polymerase (Takara Shuzo, Co.. Ltd.), T4 DNA ligase (Takara Shuzo. Co.. 
Ltd.) or the like. 

15 [0031] The nucleotide sequence of the thus-obtained DNA is determined. The determination of the nucleotide, 
sequence may be conducted according to a known method such as Maxam-Gilbert chemical modification method or a 
dideoxynucleotide chain termination method using Ml 3 phage. Generally, the sequence is determined by using an 
automatic nucleotide sequencer (e.g.. ALF (Pharmacia). 373A DNA sequencer (Perkin-Elmer). etc.). 
[0032] SEQ ID NOS:2 and 3 exemplify the nucleotide sequence of the gene of the present invention and the amino 

20 acid sequence of the endonuclease of the present invention, respectively. The endonuclease of the invention acquires 
the essential function of endonuclease Seel or Suvl. i.e.. the function of recognizing the consensus sequence "CAN- 
RYNNANNCYYGTTW" and a sequence similar thereto, by linking to the larger subunit of natural endonuclease. The 
endonuclease of the invention exerts the function of the smaller subunit of the natural endonuclease and can specif 1- 
cally recognize the 26 bases represented by "GCCCAGACATATCCCTGAATGATACC (SEQ ID NO:1) or a fragment 

25 thereof that comprises the sequence information necessary to be specifically recognized and/or cleaved by the endo- 
nuclease of the invention. 

[0033] The endonuclease of the present invention may include at least one mutation such as deletion, substitution, 
addition or the like of the amino acid as long as it can specifically recognize the above 26 bases (SEQ ID NO:1). 
[0034] For example, the amino acid sequence represented by SEQ ID NO:3 may include deletion of at least one. pref- 
30 erably 1 to 10. more preferably 1 to 5 amino acids; addition of at leiast 1 . preferably 1 to 10. more preferably 1 to 5 amino 
acids; or substitution of at least 1. preferably 1 to 10, more preferably 1 to 5 amino acids for another amino acids. The 
endonuclease of the invention does not have to include mutations of all of the above-described 37 or 39 amino adds as 
long as it can recognize the above 26 bases (SEQ ID NO:1). 

[0035] The phrase "can recognize" as used herein refers to the function off the endonudease of the invention to bind 
35 to a site of the 26 bases within the gene and to cleave the gene such that the 26 base pairs are separated into two frag- 
ments with staggered ends. * ' 
. [0036] DNA that can hybridize with the at>ove gene (SEQ ID NO:2) under stringent conditions may also be included 
in the gene of the present invention. The stringent conditions are, for example, a sodium concentration of 15 to 900 mM 
and a temperature of 37 to 70*C, preferably 68*'C. 

40 

(3) Preparation and transformation of recombinant vector 
(i) Preparation of recombinant vector 

45 [0037] A recombinant vector of the invention may be obtained by ligating (inserting) the gene of the invention to (into) 
a suitable vector. The vector for inserting the gene of the invention is not limited to a specific one as long as it is riepli- 
cable in a host cell. Examples of such vector include but not limited to plasmid DNA and phage DNA. 
[0038] ' The plasmid DNA Is, for exanple. plasmid from Eco// (e.g., pRSET, pTZ19R. pBR322. pBR325, pUC1 18. 
pUC119, etc.). plasmid from bacillus (e.g.. pUBIIO, pTP5. etc.). or plasmid from yeast (e.g.. YEp13. YEp24, YCpSO. 

50 etc.). The phage DNA is. for example, X phage or the like. Similarly, an animal virus vector such as a retrovirus or vac- 
cinia virus vector; 6r an insect virus vector such as a t)aculovirus vector may also be used. 

[0039] In order to insert tine gene of the invention into the vector, tiie purified DNA is cleaved with a suitable restriction 
enzyme. Then, the cleaved fragment is inserted into the restriction site or a multicloning site of the suitable vector DNA. 
[0040] Thegeneof the present invention should be integrated into the vector such that tfie gene is able to function. If 
55 desired, the vector of the invention may include, other than the gene of the invention and the promoter, for example, a 
c/s-element (e.g., an enhancer), a splicing signal, a PQly(A) tail signal, a selective marker, and a ribosome binding 
sequence (SD sequence). Examples of the selective marker include dihydrofolate reductase gene, anrpicillin-resistant 
gene and neomycin-resistaint gene. 
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(ii) Preparation of transfbrmant 

[0041] A transformant of the invention may be obtained by introducing the recombinant vector of the irwention into a 
host cell in such a manner that the gene of interest is capaljle to be expressed. The host cell is not limited to a specific 
5 one as long as it can express the DNA of the present invention. Bacteria such as genus Escherichia (e.g., Escherichia 
coll), genus Bacillus (e.g.. Badllus subtitis), genus Pseudomonas (e.g.. Pseudomonas putida), genus Rhizobium {e,g.. 
Rhizobium meliioti), yeast such as Schlzosaccharomyces pombe, animal cells (e.g.. COS and CHO cells), and insect 
cells (e.g.. Sf9 and Sf21) are exemplified. 

[0042] When a bacterium such as E.coli is used as the host, it is preferable that the recombinant vector of the present 
10 invention is capable of autonomous replication and includes a promoter, a ribosome binding sequence, the gene of the 
invention and a transcription termination sequence. The recombinant vector may also include a gene for controlling the 
promoter. 

[0043] As the E.coii, E. coli K12 and DHl are exemplified and as Isacillus. Baciiius subtHis MM 14 and 207-21 are 
exemplified. 

IS [0044] As the promoter, any promoter may be used as long as it can be expressed in the host cell like E.co//. For- 
example. a promoter derived from E.coH or a phage, e.g.. trp promoter, lac promoter, pl promoter or pr promoter, may 
l>e used. Artificially designed and modified promoter like tac promoter may also be used. 

[0045] The recombinant vector may be introduced into the bacterium according to any method for introducing DNA 
into a bacterium. For example, caldum ion method (Cohen, S.N. et al.. Proc. Natl. Acad. Sci., USA. 69: 2110-2114 

20 (1972)) and an el ectroporation method may be employed. 

[0046] An yeast such as Saccharomyces cerevisiae, Saccharomyces uvarum, Schizosaccharomyces pombe or 
Pichia pastoris may also be used as the host In this case, the promoter may be any promoter that can be expressed 
in the yeasL Exanples of such promoter Include but not limited to gall promoter. gallO promoter, heat shock protein 
promoter, MF 1 promoter. PH05 promoter. PGKpromoter. GAP promoter. ADH promoter and AOX1 promoter. 

25 [00471 The recombinant vector may be introduced into the yeast by any method for introducing DNA into an yeast. 
For example, electroporation method (Becker. D.M. et al.. Methods Enzymol.. 194. 182-187 (1990)). spheropiast 
method (Hinnen. A. et al.. Proc. Natl. Acad. Sci.. USA. 75. 1929-1933 (1978)). or lithium acetate method (Itoh. H.. J. 
BacterioL. 153, 163-168 (1983)) may be employed. 

[0048] An animal cell such as simian cell COS-7, Vero. Chinese hamster ovary cell (CHO cell), mouse L cell, rat GH3 
30 or human FL cell may also be used as the host. As a promoter, for example. SR promoter. SV40 promoter, LTR pro- 
moter or CMV promoter may be used. Other than these promoters, for example, an early gene promoter of human 
cytomegalovirus may also be used. 

[0049] The recombinant vector may be introduced into the animal cell, for example, by an electroporation method, a 
calcium phosphate method or a lipofection method. 
35 [0050] An insect cell such as Sf9 cell, Sf21 cell or the like may also be used as the host. The recombinant vector may 
be introduced into the insect cell, for example, by a caldum phosphate method, a Ipofection method or an electropora- 
tion method. 

(5) Production of endonudease 

40 

[0051] The endonudease of the present invention may be obtained by culturing the afc»ove-d escribed transformant, 
and recovering the endonudease from the culture thereof. The term "culture" as used herein refers to a culture super- 
natant, a cultured cell or microbial cell, or a cell or microbial cell debris. 

[0052] The transfbrmant of the invention is cultured according to a general method used for culturing the host. 
45 [0053] A medium for culturing the transformant obtained from a microorganism host such as E.coli or yeast may be 

either a natural or a synthetic medium as long as it contains cartron sources, nitrogen sources, inorganic salts and the 

like assimilatDle by the microorganism, and as long as it can efficiently culture the transformant. 

[0054] As carton sources, cartxjhydrate such as glucose, fructose, sucrose, starch; organic acids such as acetic add, 

propionic add; and alcohols such as ethanol and propanol may be used. 
50 [0055] As niti'ogen sources, ammonia; ammonium salts of inorganic or organic adds such as ammonium chloride. 

ammonium sulfate, ammonium acetate, ammonium phosphate; other nitrogen-containing compounds; Peptone; meat 

extract; corn steep liquor and the like may be used. 

[0058] As inorganic substances, potassium dihydrogen phosphate, dipotassium hydrogen phosphate, magnesium 
phosphate, magnesium sulfate, sodium chloride, iron(ll) sulfate, manganese sulfote. copper sulfate, calcium cartsonate 
55 and the like may be used. 

[0057] The cultivation is generally performed under aerobic conditions such as shaking or aeration agitating condi- 
tions at 37°C for 12 to 18 hours. During the cultivation. pH is maintained at 6.5 to 7.5, preferably 7.0. pH is regulated 
witii an inorganic or organic acid, an alkafi solution or the like. 
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[0058] During the cultivation, an antibiotic such as ampiclilin, tetracycline or the like may be added to the medium if 
necessary. 

[0059] When culturing a microorganism transformed with an expression vector using an inducible promoter, an 
inducer may be added to the medium at need. For example, when a microorganism transformed with an expression 
5 vector using Lac promoter or trp promoter is cultured. Isopropyl 1 -thio-p-D-galactoside (IPTG) or indoleacetic acid (lAA) 
may be added to the medium, respectively. 

[0060] A transfbrmant obtained by using an animal cell host may be cultured in a generally used medium such as 
RPMI1640 medium or DMEM medium, or a medium obtained by supplementing the generally used medium with fetal 
bovine serum ard the like. 

10 [0061 ] The cultivation is generally conducted under 5% CO2 at 37°C for 1 to 3 days. During the cultivation, an antibi- 
otic such as kanamycin, penicillin or the like may be added to the medium. 

[0062] After the cultivation, where a microbial cell or another cell intracelluraly produced endonudease of the inven- 
tion, the endonudease Is extracted by disrupting the microbial cell or the other cell. Where a microbial cell or another 
cell extracenularly produced endonudease of the invention, the culture solution Is directly used. Alternatively, the micro- 
15 bial cell or the other cell Is removed through centrifugation or the like before isolating and purifying the endonudease of 
the invention from the culture through a general biochemical method for protein isdation and purification such as 
ammonium sulfate prec'qsitation. gel chromatography, ion exchange chromatography, affinity chromatography, or a com- 
bination thereof. 

20 EXAMPLES 

[0063] Hereinafter, the present invention will be described in detail by way of examples which do not limit the technical 
scope of the present invention. 

25 Example 1 : Preparation of sinale-stranded template DNA encoding su bunit of Seel and contalnino deoxvuradi 

[0064] EndcScel 50 kOa subunit gene ENS2 (1 431 base pairs; Nakagawa. K.. Morishima, N.. and Shibata, T. J, Biol. 
Chem. 266. 1977-1984 (1991)) was modified simultaneously within two regions of the gene. i.e.. within the upstream 
moiety of 1.0 kilobase pair and the downstream moiety of 0.4 kilobase pair. For this purpose, EcoRI/EcoRI fragment 

30 (1671 base pairs) containing the full-length 50 kDa subunit gene (Nakagawa, K., Morishima, N,, and Shibata, T. J.. BioL 
Chem. 266. 1977-1984 (1991)), arid Pstl/EcoRI fragment (534 base pairs) containing the downstream moiety of the 50 
kDa subunit gene were separately doned into phagemids pUC1 18 (Takara Shuzo, Co.. Ltd.), and were named plasmids 
pEN1.7 and pENO.5. respectively (Fig. 3). These phagemids were introduced into E.coii strains CJ236 (Takara Shuzo, 
Co., Ltd.) for transformation. The transfbrmant E.coii strains were shake cultured at 37*C for 12 hours or longer so as 

35 to prepare preculture solutions. Twenty nl of each pre-culture solution was added to 2 ml of 2 x YT culture medium con- 
taining anpidllin (ICQ ng/ml) (SambrooK Fritsch, E. R. and Maniatis; T. Molecular Cloning: a laboratory Manual, 
Second Edition. Cold Spring Hart>or Laboratory Press, CokJ Spring Hartor. New York (1989)) and cultured at 37**C for 
1 hour. To each medium, helper phage M13K07 (2.0 x 10^^ piaque-forming unit (pfu): Takara Shuzo, Co.. Ltd.) was 
added to constitute 0.4% in volume of the medium, and the resultant was cultured at 37°C for 1 hour. Thereafter, kan- 

40 amydn (100 MQ/ml) was added and the resultant was cultured at 37**C for 14 hours. Phage partides released from 
E.coli Into the media during the cultivation were recovered. Specifically. 1 .5 ml of each culture solution was centrif uged 
(1 4.000 rpm, 5 min.) in a micro-centrifuge. 1 .2 ml of ttie supernatant was collected and centrif uged under the same con- 
ditions to completely remove the cell, thereby obtaining 1 .0 ml of the supernatant Subsequent procedure for preparing 
the DNA was conducted according to DNA purification of bacteriophage Ml 3 phage summarized in the experimental 

45 text of J. Sambrook et al..(Sambrook. J.. Fritsch, E, F, and Maniatis. T Molecular Cloning: a laboratory Manual, Second 
Edition, Cold Spring Harbor Laboratory Press. Cold Spring Harbor, Now Yort< (1 989)). 

Example 2: Synthesis of oliaonudeotides for Introdudno s he-directed mutation 

so {Q Synthesis of single strand 

[0065] 50 kDa subunit gene ENS2 is encoded by mitochondria genome, and thus indudes genetic code unique to 
mitochondria (Table 1). In order to subject ENS2 to a general mass-expression system, these unique codons must be 
replaced so as to correspond to the universal code. According to Example 2. the bases were substituted by using a 
55 modified method of T Kunkel (Kunkel, T. A., Proa Natl. Acad Sci. U.S. A: 82, 488-492 (1985)). Whereas the general 
Kunkel method uses only one or two digonucleotides for modification, the present method simultaneously used a max- 
imum of 16 oligonucleotides for efficient substitution at multiple sites. 

[0066] 33 oligonudeotides were designed which each contained the base to be substituted flanked by approximately 
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10 to IS bases (Table 3). 

Table 3 



10 



so 



30 



1. AAAAGACTGOATTATAGAA (SEQ ID NO : 6 ) 
(A) 

2. TGAATATATaTATAAATTT (SEQ ID NO : 7 ) 
(A) 

3. TATTAAATGGGATAATAAT (SEQ ID NO : o ) 

(A) 

4. TATTAGATATGTATTATAATG (SEQ ID NO: 9) 

(A) 

5. TACACCTATGTCTAATAAA (SEQ ID NO: 10) 
15 (A) 

6. AAAATATTATGGATTATAAA (SEQ ID NO: 11) 

(A) 

7 . TTTTATATTTTAAATAAAATGAAAATGGAAATGGATAATTATAATAATAATA 

(A) (A) (A) (SEQ ID NO: 12) 

8. AAAATATTATGAATAATTTAA (SEQ ID NO: 13) 
(A) 

9. ACTATCTAATATTGAAACTAATTTATCTAATAATTT (SEQ ID NO: 14) 

(CT) 

10. TTATTTAATGGATAAATAT (SEQ ID NO: 15) 

(A) 

25 11. ATAAATATATGAAATATTTAG (SEQ ID NO: 16) 

(A) 

12. ATAATTATATGTTTAATAATA (SEQ ID NO: 17) 

(A) 

13. CGAGGTATTACAATTACIAATCATGCTAATGAT (SEQ ID NO: 18) 

(CTA) 

14. TTTTAGTAGAAAAATGGATGGATACTTTAAAAGATA (SEQ ID NO: 19) 

^A) (A) 

15. AGCTAAAGAAAAGATTTTTACTAATATTTATAATAATTA (SEQ ID NO: 20) 

(CT) 

16. AAATATTATGGATATTAAA (SEQ ID NO: 21) 
35 (C) 

17. TAATTATTGGTTATCTGG (SEQ ID NO: 22) 

(A) 

18. ATCATCTATGTATAATCCT (SEQ ID NO: 23} 

(A) 

^ 19. TTAAAAATATGAGACCTAG (SEQ ID NO: 24) 

(A) 

20. GATGAATTAATGAAATTTATTTA (SEQ ID NO: 25) 

(A) 

21. ATTAAATTTAGATTTAATACITTTATTAAATCATATAAT (SEQ ID NO: 26} 

(CTA) 

^ 22. TATAATAAATATATTAATATGCATAATGCACGTAAACC (SEQ ID NO: 27) 

(A) 

23. TAAATTTTTAATAAATAATATGAiZITGTTTTATTAAATGgGA (SEQ ID NO : 2 8 ) 

(ACTA) 

24. AAGATTAATGAATTCAAAA (SEQ ID NO: 29) 
50 (A) 

25. GATTATAAATTATTATATACTTATTTTTATATTTTAAAT (SEQ ID NO: 30) 

(CT) 

26. gAATAATTTAAATTATAAAACXTCTAATATTGAAacTA (SEQ ID NO: 31) 

55 
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(CTA) 

27 TTCTCTATTAATATTAAAACTAATTTAGCTAAAGAAA 

(CT) 

2 8 . AAATTATTTACCAG AACTACTGATGAATTAATgAAATT 

(CT) 

2 9 , CATATAATTGGAAT AATAGA 

(A) 

3 0 AATTTTTAATOAATAATATg 

(A) 

3 1 , TTTAGATATGTTAAATATg 
(A) 

3 2 . ATATgTTAAATATGATTCCTAATAA 

(A) 

3 3 , CTGgATTATGGAATATGAAT 
(A) 



(SEQ 


ID 


NO: 


32) 


(SEQ 


ID 


NO: 


33) 


(SEQ 


ID 


NO: 


34) 


(SEQ 


lb 


NO: 


35) 


(SEQ 


ID 


NO: 


36) 


(SEQ 


ID 


NO: 


37). 


(SEQ 


ID 


NO: 


38) 



[0067] In Table 3, the base(s) in parentheses underneath each sequence represent the original base(s) that was 
(were) substituted for the underlined base(s). The bases showvn in small letters represent those which have already 
replaced the original oligonucleotide. 

[0068] The lengths of the oligonucleotides vary within the range of 18 to 52 bases and they include mutation of 1 to 
a maximum of 4 residues. These oligonucleotides were used for substituting 50 base pairs of the 1431 bp 50 kDa sub- 
unit gene to modify 37 codoris. 5* end of each oligonucleotide was phosphorylated so as to allow the DNA ligase reac- 
tion described later. The composition of the reaction mixture for the phosphorylation is showri below: 

100mMTris-HCI(pHa0) 

1 0 mM magnesium chloride 

7mMdithiothreitol 

1 mM ATP, 1 jiM oligonucleotide 

T4 polynucleotide kinase (15 units) 

Total amount 30 

[0069] The reaction mixture was subjected to phosphorylation reaction at 37*»C for15 min.. and then the enzyme was 
inactivated through a treatment at 70*'C for 10 min. 

(ii) Synthesis of complementary strand 

[0070] The oligonucleotides obtained in fi) were treated as follows to obtain doiible-strands. Compositions of the 
annealing txjffer and the elongation reaction buffer are shown below: 



Annealing buffer 
200 mM Tris-HCI (pH 8.0) 
100 mM magnesium chloride 
500 mM sodium chloride 
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(continued) 
Annealing buffer 
10 mM dithiothreitol 



Elongation reaction buffer 
50 mM Tris-HCI (pH 8.0) " 
5 mM dithiothreitol 
60 mM amnranium acetate 
0.5 mM each of dNTPs (A. C, T. G) 
5 mM magnesium chloride 
1 mM niootinamido adenine dnucleotide 



[0071] Distilled water was added to 1 jJ of the annealing buffer and 0.2 pmol of the single-stranded template DNA. 
resulting in a total amount of 10 ^il. One jd of the solution was dispensed to be mixed with 1 jil of the phosphorylated 
oligonucleotide solution. The resultant mixture was left to stand at SS^'C for 15 min. and then at 37'»C for 15 min.. 
wherdby the oligonucleotide annealed to the single-stranded DNA. To the solution, 25 y\ of the elongation buffer. 60 
units of E.coJi DNA ligase and 1 unrt of T4 DNA polymerase were added and left to stand at 25*»C for 2 hours so as to 
synthesize a conplementary strand. Three y\ of 0.2 M ethylene diamine tetra acetic acid tetrasodium salt (pH 8.0) was 
added to terminate the enzyme reaction, after which the enzyme was inactivated through treatment at 65°C for 5 nnin. 
This reaction solution was directly used for the sut:>sequent transformation. 



Example 3: Transformation 

[0072] £.coli BMH71-18 mutS (Takara Shuzo, Co.. Ud.) was used such that the nucleotide sequence of the wild-type 
DNA strand (the single-stranded DNA prepared with CJ236) in the double-stranded plasmid DNA obtained through the 
complementary strand synthesis was substituted for a mutant type. In this E.coli strain, deoxyuracil contained in the sin- 
gle-stranded DNA prepared with CJ236 was hydrolyzed by enzyme uracil-DNA glycosylase and then synthesized again 
using the DNA strand containing the substituted base as a template (Undahl. T, Ann. Rev, Biochem, 51 , 61-87 (1982). 
[0073] The whole reaction mixture with the synthesized complementary strand was added tp 100 mI solution contain- 
ing conpetent cell of BMH71 -18 mutS. E.coli competent cell was prepared according to the method of H. Inoue et al. 
(Inoue. H.. Nojima, H., and Okayama, H.. Gene 96. 23-28 (1990)). 

[0074] To the solution, a medium was added and left at 37«»C for 1 hour. Then. 30 of helper phage {supra) was 
added and left to stand at 37«C for another 30 min. for infection to take place. 40 ^il of culture solution of BMH71-18 
intracellularly containing both the helper phage and the plasmid was fractionated and added to 2 ml of 2 x YT medium 
containing airpidllin (100 jig/ml) and kanamydn (100 pQ/vnti. The resultant medium was shake cultured at 3r»C for 16 
to 20 hours to produce a phage. 

[0075] The microbial cell was removed through centrifugation (14.000 rpm. 5 min.). The supernatant was collected 
which contained the phage partide incorporating the single-stranded DNA of the plasmid with the substituted base. 
With 20 of the supernatant. 80 nl of strain MV1 1 84 (Takara Shuzo. Co.. Ltd.) which has been cultured for 12 hours or 
longer was mixed and left to stand at 37*»C for 1 0 min. so as to inject the single-stranded DNA of the phage into the cell. 
The strain MV1184 containing the plasmid resulting from replication of the Integrated single-stranded DNA was inocu- 
lated to an LB agar medium containing 100 ng/m\ ampidlUn (Sambrook. J., Fritsch, E. F.. and Maniatis. T. Molecular 
Cloning: a laboratory Manual. Second Edition. Cold Spring Harbor Laboratory Press. Cold Spring Harbor New York 
(1989)) for selection. The nudeotide sequences of the 50 kDa subunit genes were analyzed for some dones with an 
automatic sequencer ALF (Pharmacia) to confirm the incorporation of ttie predetermined substitutions. The fluorescent 
primer for the analysis of the nudeotide sequence was purchased from Pharmacia (Uppsala. Sweden). The DNA 
sequendng reaction was based on the Sanger method (Sanger. F. et al., Proc. Natl. Acad Sci, 74. 5463-5467 (1977)) 
according to the protocol of Pharmacia. 

[0076] For the starting material, plasmid pEN1.7, forty nudeotide substitutions were performed by 9 cydes of the 
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mutagenic process, while 10 nucleotide substitutions were performed for pENO.5 by 4 cycles of such process. 
[0077] Once all of the substitutions were confirmed, the upstream and downstream moieties were linked at the Pst\ 
cleavage site, thereby obtaining a gene of the invention encoding the SO kDa subunit with complete substitutions (Rg. 
4. SEQ ID NO:2). 

Example 4: Construction nf expression olasmid 
for 50 kPa subunH 

[0078] For facilitating the linking between the nxxJified 50 kDa subunit gene and the vector for inducing expression 
thereof, restriction sites were introduced into the 5' and 3* terminuses of the modified gene through polymerase chain 
reaction (PGR). The reaction was performed using Taq DNA polymerase (Takara Shuzo, Co.. Ltd.) according to the pro- 
tocol of the manufacturer. Sequences of the used primers are shown below. 

5'-CC GGATCC ATGAAAAAAC-3' (SEQ ID NO:4) . 
5'-G GGTCGACT TATTTAATGTATCC-3* (SEQ ID NO:5) 

[0079] The underlined parts are the newly introduced SamHI and Sa/I recognition sequences. The reaction was per- 
formed through 25 cycles of: 94»C for 1 min.; 45**C for 2 min.; and 72**C for 3 min. 

[0080] The DNA fragment (1447 base pairs) amplified by PCR was separated through agarose (0.8%) gel electro- 
phoresis (Sambrook. J,. Fritsch. E. F, and Maniatis, T. Molecular Cloning: a laboratory Manual, Second Edition. Cokl 
Spring Harbor Laboratory Press, Cold Spring Harbor. New York (1989)) and stained with ethWium bromide for confir- 
mation. The fragment was recovered from the agarose gel using Geneclean kit (BIO101 . California, USA). 
[0081 1 The recovered DNA fragment was treated with BamHI and Sail and subcloned into pRSET (Invitrogen Corp,) 
and pT219R (Pharmacia) to obtain pSC50 and pTZSCSO, respectively. Plasmid pSCSO was used for inducing the 
expression. Plasmid pTZSCSO was subjected to DNA sequencing using fluorescent primer (isupra) so as to confirm that 
no extra mutation had been introduced during the PCR. 

Example 5: Induction of expression o f 50 kDa subunit 

[0082] Expression plasmid pSC50 was introduced into a competent cell of E.coli BL21 (DE3) pLysS (Invitrogen 
Corp.). The transformant cell was pre-cultured thi-ough shake cultivation in an LB liquid medium containing ampidllin 
(150 iiQ/m\) and chloramphenicol (34 ^ig/ml) (Sambrook, J., Fritsch. E. F. and Maniatis. T. Molecular Cloning: a latxyra- 
tory Manual. Second Edition. Cold Spring Harbor Laboratory Press. C^old Spring HartkDr. New York (1989)) at 37»C 
overnight The pre-cultured solution (4% in volume of a resultant culture) was centrifuged (2,500 x g. 10 min.) to recover 
the microbial cell. This precipitate was suspended in a small anrourrt of fresh medium, which was then added to a liquid 
medium. Shake cultivation (37''C) was performed until tiie suspension level at 600 nanometers (nm) (OD600) of about 
0.5 was obtained. Thereafter, shake cultivation was continued at 18°C. When the OD600 became about 0.8. isopropyl 
1-thlo-p-D-galactoslde (IPTG) was added to tiie final concentration of 0.4 mM to initiate induction of expression of the 
50 kDa subunit After performing shake cultivation at 18»C for another 12 hours. Bcoliwas recovered through centrifu- 
gatioh, rapidly frozen with liquid nitrogen and stored at -80°C. 

Example 6: Purification of 50 kDa subunit from E.coli 

[0083] The microbial cell stored at -80°C was melted at room temperature. The subsequent treatments were con- 
ducted at 4"C or on ice. The microbial cell was suspended in Buffer A (20 mM Tris-HCI buffer (pH 8.0). 500 mM sodium 
chloride, 5 mM imidazole. 1 mM phenylmethylsulfonyl fluoride (PMSF; Sigma Aldrich Japan K.K.. Tokyo. Japan). 0.1 % 
NP-40 (Nacalai Tesque. Inc. Kyoto, Japan)). The resultant suspension was rapidly frozen with liquid nitrogen and 
melted under running water to disrupt tiie E.co//. The suspendon was treated for 5 times witii an ultirasonicator (UR- 
200P. Tomy Seiko Co.. Ltd.. Tokyo, Japan) at a maximum output for 30 sec. 

[0084] The treated solution was centi^ifuged (39,000 x g. 20 min.) at 4 "C. The obtained supernatant was f Htrated 
through 0.45 ym Mylex filter (Millipore. Massachusetts, USA). The sample was placed in a column (0 10 mm. 2.0 ml) 
loaded with Probond Nickel Chelate Resin (Invitrogen Corp.) which had been equibralized witti Buffer A. Then, tiie sam- 
ple was washed with 20 ml of Buffer A (ten times the volume of the resin). After another washing with 12 ml Buffer A 
containing 60 mM imidazole (six times the volume of the resin), the resultant was subjected to gradient elution witii 60- 
50 mM imidazole-containing Buffer A (total amount of 80 ml). The eluted fraction was subjected to sodium dodecyl sul- 
fatepdyacrylamide gel electrophoresis (Laefnmli. U. K. Nature, 227, 680-685 (1970)) and stained wHh Coomasie bril- 
liant blue to confirm the presence of the 50 kDa subunit The fractfon containing tiie 50 kDa subunit was dialyzed 
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against Buffer B (20 mM Tris-HQ buffer (pH 7.5). 300 mM scxfium chloride, 1 mM ethylenediaminetetraacetic acid tet- 
rasodium salt 1 mM dithiothreitol). The purified protein was quantified using Protein assay agent (Bio-Rad. California. 
USA) according to the micro-assay method of the manufacturer Bovine serum albumin solution (Sigma Aldrich Japan 
K.K.) was used as a standard protein. As a result, 300 jig of purified protein was obtained from 25 g (wet weight) of the 
5 microbial cell. 

Example 7: Measurement of endonucle ase activity 

[0085] A substrate for measuring an endonuclease activity of the 50 kDa subunit was prepared as follows. An 
10 EcoRyEcoHi fragment containing o//2 region on mitochondria DNA within which Endo.Scel is known to cleave (1671 
base pairs: Nakagawa. K. et al.. EMBO J. 11, 2707-2715 (1992)) was subcloned into phagemid pUC119 (Takara 
Shuzo. Co.. Ltd.). the resultant called pY673L (Rg. 2). Plasmid pBR322 (Takara Shuzo. Co.. Ltd.) was used as a control 
DNA substrate. Plasmids pY673L and pBR322 were used to transform E.co//. Then, the plasmids were extracted from 
£.co// and highly purified using Qiagen column (Qiagen Japan, Tokyo. Japan). 
15 [0086] The composition of the reaction solution used for measuring the endonudease activity of the 50 kDa subunit 
is shown below: 



20 50 mM Tris-HCI buffer (pH 8.0) 

50 mM sodium chloride 
10 mM magnesium chloride 
1 mM dithiothreitol 

25 

25 ng substrate DNA (pY673L or pBR322 which has been linearized with restriction enzyme Seal {Fig. 2)) 

0.4 to 60 ng 50 kDa subunit 

Total volume 30 pi 

30 

[0087] Afler performing the DNA cleavage reaction at 37°C for 30 min., ethylenediaminetetraacetic acid tetrasodium 
saft and sodium dodecyl sulfate were added to final concentrations of 10 mM and 0.3%, respectively, to terminate the 
reaction. The cleaved DNA was subjected to 0.8% agarose electrophoresis either directly or after concentrating the 
35 DNA through phenol extraction and ethanol precipitation. 

[0088] After the electrophoresis, the gel was stained with ethidium brontide (Sigma Aldrteh Japan K.K.) or SYBR 
Green (Takara Shuzo. Co.. Ltd.) to confirm cleavage of DNA. The DNA was detected using FMBIO Imaging device 
(Takara Shuzo, Co.. Ltd.) to determine the DNA cleavage. 

40 Example 8: Detection of s equence-specific endonudease 

[0089] The dimeric EndcScel recognizes and cleaves in vivo and in vitro 26 base pairs similar to the consensus 
sequence within the o//2 gene region on the mitochondria DNA (Nakagawa. K., Morishima, N., and Shibata. T. EMBO 
J. 11. 2707-2715 (1992)) (Fig. 2). The purified 50 kDa subunit of EndaScel cleaved a specific sequence (SEQ ID 

45 NO:1) by itself. ^ 

[0090] Specific cleavage of o//2 with the 50 kDa subunit using plasmid pY673L containing o//2 as the substrate was 
confirmed (Rg. 5A). Refemng to Fig. 5A. Unes 1 to 8 are the resurts obtained with 0.5. 1.0. 2.0, 4.0. 8.0. 16.0, 32.0 
and 64.0 ng of the 50 kDa subunits. respectively. With 64 ng of 50 kDa subunit 60% of pY673L (25 ng) in the reaction 
solution was sequence-specifically cleaved at 37'>C within 30 min.. whereby DNA fragments of 3.4 and 1.4 Wlobases 

so were detected. 

[0091] DNA was not cleaved with the 50 kDa subunit using plasmid pBR322 as the substrate and no cleavage frag- 
ment was detected (Fig. SB), Referring to Rg. SB. Lanes 1 to 7 are the results obtained with 2.3, 4.5. 9.0. 18.0. 36.0, 
72.0 and 144 ng of the 50 kDa subunits. respectively. When an excessive amount (200 ng) of the 50 kDa sUxjnit was 
used, no cleavage was found with pBR322 or other DNAs (mitochondria DNA (80 Wlobase pairs) from bud yeast strain, 
55 E CO// phage X DNA (47 kiiobase pairs), and bacillus phage (|>105 DNA (38 Wlobase pairs) which did not contain spedfte 
sequence (26 base pairs) within o//2 gene region). 
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Example 9: Mass production of 50 kPa subunit from fiaccharomvcBS uv arum and detection of the activity thereof 

[0092] Endo-Suvl 50 kDa subunit, a homologous protein of Endo. See I 50 kDa subunit from Saccharomyces 
cerevisiae is present in Saccharomyces uvarum (Nakagawa. K.. Morishima, N.. and Shibata. T. J. Bio. Chem. 266. 
1977-1984 (1991)). Both subunits have 476 amino acid residues but there are two differences in amino acid level 
between them. The amino add differences between Endo.Scel and Endo.Suvl 50 kDa subunits are shown in Fig. 6A. 
[0093] A mass-expression gene for Endo.Savl 50 kDa sutttjnit was prepared by introducing two additional modifica- 
tions into the modified gene for EndaScel 50 kDa subunit. The oligonucleotides used for the substitutions of the amino 
acids for Endo.SL/vl 50 kDa subunit are shown in Rg. 6B. With reference to Fig. 68, the bases in parentheses con-e- 
spond to the nucleotide sequence of Endo.Scel 50 kDa subunit. For this purpose, two oligonucleotides were newly syn- 
thesized to introduce mutations according to the gene modification method described above. The mutation was 
confirmed through DNA sequencing. The modified gene was subcloned into pRSET vector (Invitrogen Corp.). which 
was then introduced Into E.coli BL21 (DE3) pLys by transformation method. 

[0094] Endo.Suvl 50 kDa subunit was expressed and purified according to tiie method applied to Endo.Scel 50 kDa 
subunit described at)0ve. The purified Erxlo.Suvl 50 kDa subunit was used to specifically cleave plasmid pY673L 
[0095] As a result, Endo.Suvl 50 kDa subunit was equally effective in sequence-spedfically cleaving the Endo.Scel 
50 kDa sulxinit cleavage site on plasmid pY673L (Fig. 7A). 

[0096] ly/1eanwhile. Endo.St/vl 50 kDa subunit did not deave plasmid pBR322 at all (Fig. 7B). Also, other DNAs (bud 
yeast mitochondria DNA, bacillus phage ((> 105 X. DNA and E.coli phage X DNA which did not contain o//2 gene region) 
were not cleaved by Endo. Suvi 50 kDa subunit. 

[0097] In Figs. 7A and 7B. Lanes 1 to 7 are the results obtained with 2.3. 4.5, 9.0. 18.0. 36.0. 72.0 and 144 ng of 50 
kDa subunits. respectively. 

[0098] According to the present Invention, a site-spedf tc endonuclease capable of recognizing a specific nucleotide 
sequence, a gene encoding the endonuclease. a recombinant vector containing the gene, a transformant containing tiie 
vector, and a process for producing tiie endonuclease are provided. Since the endonuclease of the present invention is 
capable of recognizing a specific sequence of 26 bases, it is useful in ttie field of genetic engineering and biochemistry 
in modifying and mapping DNA for a wide application, i.e.. plasmid to genome. 

[0099] Ail publications, patents, and patent applications cited herein are incorporated herein by reference in their 
entirety. 

[01 00] The following are information on sequences described herein: 

SEQUENCE LISTING 



<110> The Institute of Physical and Chemical Research 



<120> Endonuclease 



<130> PH-651 



<I40> 
<141> 



<15p> JP98/141861 
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<151> 1998-05-22 
<160> 38 

<170> Patent In Ver. 2.0 

<210> 1 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic DNA 
<400> 1 

gcccagacal atccctgaat gatacc 

<210> 2 
<2ll> 1431 
<212> DNA 

<213> Saccharorayces cerevisiae 

<220> 

<22l> CDS 

<222> (1)..(1428) 
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<400> 2 

atg aaa aaa caa aat tta aat tct att tta tta atg tat att aat tat 48 
Met Lys Lys Gin Asti Leu Asn Ser He Leu Leu Met Tyr He Asn Tyr 

1 5 10 15 

att att aat tat ttt aat aat att cat aaa aat caa tta aaa aaa gac 96 
lie lie Asn Tyr Phe Asn Asn He His Lys Asn Gin Leu Lys Lys Asp 

20 ■ 25 30 

tgg att atg gaa tat gaa tat atg tat aaa ttt tta atg aat aat atg 144 
Trp lie Met Glu Tyr Glu Tyr Met Tyr Lys Phe Leu Met Asn Asn Met 

35 40 45 ■ 

act tgl ttt att aaa tgg gat aat aat aaa att tta tta tta tta gat 192 
Thr Cys Phe lie Lys Trp Asp Asn Asn Lys He Leu Leu Leu Leu Asp 

50 55 60 

atg tat tat aat gta tta tat aac tat cat aaa caa cgt aca cct atg 240 
Met Tyr Tyr Asn Val Leu Tyr Asn Tyr His Lys Gin Arg Thr Pro Met 

65 70 ' ^5 . 

tct aat aaa aga tta atg aat tea aaa aat att atg gat tat aaa tta 288 
Ser Asn Lys Arg Leu Met Asn Ser Lys Asn He Met Asp Tyr Lys Leu 

85 90 95 

tta tat act tat tit tat att tta aat aaa atg aaa atg gaa atg gat 336 
Leu Tyr Thr Tyr Phe Tyr lie Leu Asn Lys Met Lys Met Glu Met Asp 

100 ■ 105 no 

aat. tat aat aat aat aat aat aat att tea tta aaa tat aat gaa tta 384 
Asn Tyr Asn Asn Asn Asn Asn Asn He Ser Leu Lys Tyr Asn Glu Leu 

115 120 125 

tta aaa aat att atg aat aat tta aat tat aaa act tct aat att gaa 432 
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w 



IS 



20 



25 



30 



35 



40 



45 



SO 



Leu Lys 
130 
act aat 
Thr Asn 
145 

aaa tat 
Lys Tyr 

atg ttt 
Met Phe 

tta gat 
Leu Asp 

gaa ggt 
Glu Gly 
210 
get aat 
Ala Asn 
225 

agt att 
Ser lie 

ttt gtt 
Phe Val 



Asn He Met Asn Asn Leu 
135 

tta tct aat aat ttt tat 
Leu Ser Asn Asn Phe Tyr 



atg aaa 
Met Lys 

aat aat 
Asn Asn 
180 
tta aat 
Leu Asn 
195 

gat ggt 

Asp Gly 

gat gta 
Asp Val 

tta gta 
Leu Val 

aat get 
Asn Ala 
260 



aat aat 
Asn Asn 
150 
tat tta 
Tyr Leu 
165 

att aat 
I le Asn 

aat aat 
Asn Asn 

tat att 
Tyr lie 

tta aat 
Leu Asn 
230 
gaa aaa 
Glu Lys 
245 

ttc tct 
Phe Ser 



gat atg 
Asp Met 

tat aaa 
Tyr Lys 

gaa ttt 
Glu Phe 
200 
ggt cct 
Gly Pro 
215 

act ate 
Thr lie 

tgg atg 
Trp Met 

att aat 
Me Asn 



Asn Tyr Lys Thr Ser 

140 • 
tta atg gat aaa tat 
Leu Met Asp Lys Tyr 
155 

tta aat atg att ect 
Leu Asn Met lie Pro 

170 
ggt aaa 
Cly Lys 
185 . 
tat gat 
Tyr Asp 



gga ggt 
Gly Gly 

ttt att 
Phe lie 

gat act 
Asp Thr 
250 
att aiaa 
lie Lys 
265 



tta aat att. 
Leu Asn lie 

tat tta tea 
Tyr Leu Ser 
205 

att aca att 
lie Thr He 

220 
aat aaa aga 
Asn Lys Arg 
235 

tta aaa gat 
Leu Lys Asp 

act aat tta 
Thr Asn Leu 



Asn lie Glu 

tta att. aat 
Leu I le Asn 
160 

aat aat tat 
Asn Asn Tyr 
175 

. aaa aca gta 
Lys Thr Val 
190 

ggg tta att 
Gly Leu lie 

act aat cat 
Thr Aisn His 

att aaa aat 
1 le Lys Asn 
240 

aat cct tat 
Asn Pro Tyr 

255 
get aaa gaa 
Ala Lys Glu 
270 



480 



528 



576 



624 



672 



720 



768 



816 



55 
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15 



1056 



aag att ttt act aat att tat aat aaa tta tat agt gat tat aaa att 864 
Lys He Phe Thr Asn He Tyr Asn Lys Leu Tyr Ser Asp Tyr Lys lie 

275 280 285 

aat caa att aat aat cat ate cct tat tat aat tat tta aaa att aat 912 
Asn Gin He Asn Asn His He Pro Tyr Tyr Asn Tyr Leu Lys He Asn 

290 295 300 

aat aaa tta cct att aaa aat att atg gat att aaa aat aat tat tgg 960 
Asn Lys Leu Pro He Lys Asn He Met Asp He Lys Asn Asn Tyr Trp 
305 310 315 320 

^ tta get ggt ttt aca get gca gat ggt tct ttt tta tea tct atg tat 1008 

Leu Ala Gly Phe Thr Ala Ala Asp Gly Ser Phe Leu Ser Ser Met Tyr 

* 325 330 335 

aat cct aaa gat aca tta tta ttt aaa aat atg aga cct agt tat gtt 
Asn Pro Lys Asp Thr Leu Leu Phe Lys Asn Met Arg Pro Ser Tyr Val 

340 345 350 

att tea caa gtt gaa aca cgt aaa gaa tta att tat tta att caa gaa 
He Ser Gin Val Glu Thr Arg Lys Glu Leii He Tyr Leu He Gin Glu 

355 360 365 

tct ttt gat tta tct att tct aat gtt aaa aaa gtt ggt aat aga aaa 
Ser Phe Asp Leu Ser He Ser Asn Val Lys Lys Val Gly Asn Arg Lys 

370 375 380 

tta aaa gat ttt aaa tta ttt ace aga act act gat gaa tta atg aaa 
Leu Lys Asp Phe Lys Leu Phe Thr Arg Thr Thr Asp Glu Leu Met Lys 
385 390 ' 395 400 

ttt att tat tat ttt gat aaa ttt tta cct tta cat gat aat aaa caa 
Phe He Tyr Tyr Phe Asp Lys Phe Leu Pro Leu His Asp Asn Lys Gin 



25 



30 



35 



40 



45 



SO 



1104 



1152 



1200 



1248 



55 
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405 410 415 

ttt aat tat att aaa ttt aga ttt aat act ttt att aaa tea tat aat 

Phe As^i Tyr He Lys Phe Arg Phe Asn Thr Phe He Lys Ser Tyr Asn 

420 425 430 

tgg aat aat aga gta ttt ggt tta gta tta tct gaa tat ate aat aat 

Trp Asn Asn Arg Val Phe Gly Leu Val Leu Ser Glu Tyr He Asn Asn 

435 440 445 

att aaa att gat aat tat gat tat tat tat tat aat aaa tat att aat 

ile Lys lie Asp Asn TyrAsp Tyr Tyr Tyr Tyr Asn Lys Tyr lie Asn 

450 455 460 

atg cat aat gca cgt aaa cct aaa gga tac att aaa taa 

Met His Asn Ala Arg Lys Pro Lys Gly Tyr lie Lys 

465 470 475 

<210> 3 
<2ll> 476 
<212> PRT 

<213> Saccharomyces cerevisiae 



<400> 3 

Met Lys Lys Gin Asn Leu Asn Ser Ile Leu Leu Met Tyr Ile Asn Tyr. 

1 5 10 15 

lie lie Asn Tyr Phe Asn Asn lie His Lys Asn Gin Leu Lys Lys Asp 

20 25 30. 

Trp lie Met Glu Tyr Glu Tyr Met Tyr Lys Phe Leu Met Asn Ash Met 
35 40 45 
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Thr Cys Phe He Lys Trp Asp Asn Asn Lys He Leu Leu Leu Leu Asp 

50 55 60 

Met Tyr Tyr Asn Val Leu Tyr Asn Tyr His Lys Gin Arg Thr Pro Met 
65 70 75 80 

Ser Asn Lys Arg Leu Met Asn Ser Lys Asn lie Met Asp Tyr Lys Leu 

85 90 95 

Leu Tyr Thr Tyr Phe Tyr I le Leu Asn Lys Met Lys Met Glu Met Asp 

100 105 110 

Asn Tyr Asn Asn Asn Asn Asn Asn lie Ser Leu Lys Tyr Asn Glu Leu 

115 120 125 

Leu Lys Asn He Met Asn Asn Leu Asn Tyr Lys Thr Ser Asn He Glu 

130 135 140 

Thr Asn Leu Ser Asn Asn Phe Tyr Leu Met Asp Lys Tyr Leu He Asn 
145 150 155 160 

Lys Tyr Met Lys Tyr Leu Asp Met Leu Asn Met He Pro Asn Asn Tyr 

165 170 175 

Met Phe Asn Asn He Asn Tyr Lys Gly Lys Leu Asn He Lys Thr Val 

180 185 190 

Leu Asp Leu Asn Asn Asn Glu Phe Tyr Asp Tyr Leu Ser Gly Leu He 

195 200 205 

Glu Gly Asp Gly Tyr He Gly Pro Gly Gly He Thr He Thr Asn His 

210 215 220 

Ala Asn Asp Val Leu Asn Thr He Phe He Asn Lys Arg He Lys Asn 
225 230 235 240 

Ser He Leu Val Glu Lys Trp Met Asp Thr Leu Lys Asp Asn Pro Tyr 
245 250 255 
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Phe Val Asn Ala Phe Ser He Asn He Lys Thr Asn Leu Ala Lys Glu 

260 265 . 270 

Lys lie Phe Thr Asn He Tyr Asn Lys Leu Tyr Ser Asp Tyr Lys lie 

275 280 285 

Asn Gin He Asn Asn His He Pro Tyr Tyr Asn Tyr Leu Lys He Asn 

290 295 300 

Asn Lys Leu Pro He Lys Asn He Met Asp He Lys Asn Asn Tyr Trp 
305 310 315 320 

Leu Ala Gly Phe Thr Ala Ala Asp Gly Ser Phe Leu Ser Ser Met Tyr 

325 330 335 

Asn Pro Lys Asp Thr Leu Leu Phe Lys Asn Met Arg Pro Ser Tyr Val 

340 345 350 

He Ser Gin Val Glu Thr Arg Lys Glu Leu He Tyr Leu He Gin Glu 

355 360 365 

Ser Phe Asp Leu Ser He Ser Asn Val Lys Lys Val Gly Asn Arg Lys 

370 375 380 

Leu Lys Asp Phe Lys Leu Phe Thr Arg Thr Thr Asp Glu Leu Met Lys 
385 390 395 400 

Phe He Tyr Tyr Phe Asp Lys Phe Leu Pro Leu His Asp Asn Lys Gin 

405 410 415 

Phe Asn Tyr He Lys Phe Arg Phe Asn Thr Phe He Lys Ser tyr Asn 

420 425 . 430 

Trp Asn Asn Arg Val Phe Gly Leu Val Leu Ser Glu Tyr He Asn Ash 

435 440 445 

He Lys He Asp Asn Tyr Asp Tyr Tyr Tyr Tyr Asn Lys Tyr He Asn 
450 .455 460 • 
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Mel His Asn Ala Arg Lys Pro Lys Gly Tyr lie Lys 
465 470 475 

<210> 4 
<211> 18 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic DNA 
<400> 4 

ccggatccat gaaaaaac 

<210> 5 
<211> 23 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic DNA 
<400> 5 

gggtcgactt atttaatgta tec 
<210> 6 
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<211> 19 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic DNA 

<400> 6 

aaaagac'tgg' attatagaa 

<2I0> 7 
<21l> 19 
<212> DNA 

<213>. Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic DNA 
<400> 7 

tgaatatatg tataaattt 

<210> 8 
<211> 19 
<212> DNA 

<2i3> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: Synthetic DNA 
<400> 8 

tattaaatgg gataataat 

<210> 9 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic DNA 
<400> 9 

tattagatat gtattataat g . 

<210> 10 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic DNA 
<400> 10 
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tacacctatg tctaalaaa 

<210> 11 
<2ll> 20 
<212> DN'A 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence :SyntheticDNA 
<400> 11 

aaaatattat ggattataaa 

<210> 12 
<211> 52 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic DNA 
<400> 12 

ttttatattt taaataaaat gaaaatggaa atggataatt ataataataa ta 

<210> 13 
<211> 21 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence -.Synthetic DNA 
<400> 13 

aaaatattat gaataattta a 

<210> 14 
<211> 36 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic DNA 
<400> 14 

actatctaat attgaaacta atttatctaa taattt 

<210> 15 
<2ll> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 
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<223> Description of Artificial Sequence: Synthetic DN'A 

<400> 15 

ttatttaatg gataaatat 

<210> 16 . 
<211> 21 

<212> DNA . ' 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence:Synthetic DNA 
<400> 16 

ataaatalat gaaatattta g 

<210> 17 
<2ll> 21 
<212> DNA 

<2l3> Artificial Sequence. 
<220> 

<223> Description of Artificial Sequence: Synthetic DNA 
<400> 17 

ataattatat gtttaataat a 
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<210> 18 
<2ll> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic DNA 
<400> 18 

ggaggtatta caattactaa tcatgctaat gat 

<210> 19 
<211> 36 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic DNA 
<400> 19 

ttttagtaga aaaatggatg gatactttaa aagata 

<210> 20 
<211> 39 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence .-Synthetic DNA 

<400> 20 

agctaaagaa aagattttta ctaatattla taataatta 

<210> 21 

<21l> 19 

<212> DNA 

<2I3> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic DNA 

<400> 21 

aaatattatg gatattaaa 

<210> 22 

<211> 18 

<212> DNA 

<2I3> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic DNA 
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<400> 22 

taattattgg Itaictgg 

<210> 23 
<2U> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence :Synthetic DNA 
<400> 23 

atcatctatg tataatcct 

<210> 24 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic DNA 
<400> 24 

ttaaaaatat gagacctag 
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<210> 25 
<211> 23 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Synthetic DNA 
<400> 25 

gatgaattaa tgaaatttat tta 23 

<210> 26 
<2ll> 39 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Synthetic DNA 
<400> 26 

attaaattta gatttaatac ttttattaaa tcatatiaat 39 



45 

. <210> 27 
.<2ll> 38 
<2i2> DNA 



<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: Synthetic DNA 
<400> 27 

tataataaat atattaatat gcataatgca cgtaaacc 

<210> 28 
<21l> 42 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic DNA 
<400> 28 

taaattttta ataaataata tgacttgttt tattaaatgg ga 

<210> 29 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic DNA 
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<400> 29 

aagattaatg aattcaaaa 

<210> 30 
<211> 39 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic DNA 
<400> 30 

gattataaat tattatatac itatttttat attttaaat 

<210> 31 
<211> 38 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence .-Synthetic DNA 
<400> 31 

gaataattta aattataaaa cttctaatat tgaaacta 
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<210> 32 
<211> 37 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic DNA 
<400> 32 

ttctctatta atattaaaac taatttagct aaagaaa 

<210> 33 
<2il> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic DNA 
<400> 33 

aaattattta ccagaactac tgatgaatta atgaaatt 



<210> 34 
<211> 2Q 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic DNA 
<400> 34 

catataattg gaataataga 

<210> 35 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence rSynthet ic DNA 
<400> 35 

aatttttaat gaataatatg 

<210> 36 
<211> 19 
<212> DNA. 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic DNA 
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<400> 36 

tttagatatg ttaaatatg 

<210> 37 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic DNA 
<400> 37 

atatgttaaa tatgattcct aataa 

<210> 38 
<2ll> 20 
<212> DNA 

<2i3> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence -.Synthetic DNA 
<400> 38 

ctggattatg gaatatgaat 
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Claims 

1. An endonuclease capable of recognizing the nucleotide sequence: GCCCAGACATATCCCTGAATGATACC or a 
fragment thereof. 

2. A recombinant protein of either (a) or (b): 

(a) a protein comprising the amino acid sequence represented by SEQ ID NO:3; or 

(b) a protein having an endonuclease activity for recognizing the nucleotide sequence: GCCCAGACATATC- 
CCTGAATGATACC or a fragment thereof, the protein comprising at least one deletion, substitution or addition 
of amino acid in the amino acid sequence represented by SEQ ID NO: 3. 

3. A gene encoding the recombinant protein of either (a) or (b): 

(a) a protein comprising the amino acid sequence represented by SEQ ID NO:3: or 

(b) a protein having an endonuclease activity for recognizing the nucleotide sequence: GCCCAGACATATC- 
CCTGAATGATACC or a fragment thereof, the protein comprising at least one deletion, substitution or addition 
of amino acid in the amino acid sequence represented by SEQ ID NO: 3. 

4. A gene containing DNA of either (c) or (d): 

(c) DNA comprising the nucleotide sequence represented by SEQ ID N0:2: or 

(d) DNA encoding a protein having an endonuclease activity for recognizing the nucleotide sequence: GCCCA- 
GACATATCCCTGAATGATACC or a fragment thereof, the DNA being capable of hybridizing with DNA which 
comprises tiie nucleotide sequence represented by SEQ ID NO:2 under stringent conditions. 

5. A recombinant vector comprising the gene of claim 3 or 4. 

6. A transformant comprising the recombinant vector of claim 5. 

7- A process for producing the endonudease, comprising the steps of: 
cuituring the transformant of claim 6: and 

recovering from the culture an endonuclease capable of recognizing the:nucleotide sequence: GCCCAGA- 
CATATCCCTGAATGATACC or a fragment ttiereof. 

8. An endonuclease produced by the metiiod of daim 7. 

9. Kit comprising: 

(a) the endonuclease of daim 1 or 8; and/or 

(b) the recombinant protein of claim 2, and/or 

(c) the gene of claim 3 or 4; and/or 

(d) the recombinant vector of claim 5; and/or 

(e) the transformant of claim 6. 
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FIG. 2 



o 
u 



EcoRI fragment containing 

oii2 region on Mitochondrial DNA ^ 

o 
u 
(=3 




M13IG 



Seal 



I 



Linearize through Seal treatment 



GGCCAGAC ATAT I CCCTGAATGATACC 



CGGGTCTG | TATA GGGACTTACTATGG 



Cleavage sequence of Endo. Scel 



oli2 region 



4833 bp 



Endo . Scel Treatment 



1412 bp 



3421 bp 



Plasmid PY673L 
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FIG. 3 



ECORI 



50 kDa subunif gene 
1431 base pairs 



PstI 



EcoRI 



(A) \- 



1671 base pairs 



(B) f- 



534 base pairs 



pENL7 



(a) 1671 bp fragment 




pENO.5 



(B) 534 bp fragment 




M13IG 



fA^Clonina the 1671 bp fragment (B) Cloning the 534 bp fragment 
into EcoRrsit of pUC118 into EcoRI/PstI site of pUCllS 



Plasmids pENl.7 and pENO.5 
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FIG. 4 

ATCAAAAAACAAAATTTAAATTCTATTTTATTAATGTATATTAATTATATTATTAATTATT^ 7 0 

TTCATAAAAATCAATT AAAAAAAG ACTGGATT ATGGAATATGAATATATGTATAAAT^^ TAATGAATAA 14 0 

A A A A 

TATGACTTGTTTTATTAAATGGGATAATAATAAAATTTTATTATTATTAGATATATCTTAT 210 

ACTA A A , 

TATAACTATCATAAACAACGTAC ACCTATGTCTAATAAAAGATTAATGAATTCAAAAAATATTATGGATT 280 

A A A 

ATAAATTATTAT AT ACTT ATTTTTATATTTTAAATAAAATGAAAATGGAAAT^^ 3 S O 

CT AAA 
T AATAATAATATTTCATT AAAAT ATAATGAATT ATTAAAAAATATTATGAAT AATTTAAATT AT AAAACT 420 

A .CTA 
TCTAATATTGAAACTAATTTATCTAATAATTTITATTTAATGGATAAATATTTA^ 490 

CT * A • 'A 

AATATTTAGATATCTTAAATATGATTCCTAATAATTATATCrrTTAATAATATTAATTATA-VAGGTAAACT 560 

A A A 

AAATATTAAAACAGTATTAGATTTAAATAATAATGAATTTTATGATTATTTATCAGG^ 630 

GATGGTTATATTGGTCCTGGAGGTATTACAATTACTAATCATGCTAATGATGTATO • 700 

CTA 

TTAATAAAAGAATTAAAAATAGTATTTTAGTAGAAAAATGGATGGATACTTTAAAAGATAATCCW 770 

. A A 

TGTTAATCCTTTCTCTATTAATATTAAAACTAATrrAGCTAJUVGAAAAGATTTTT^ 840 

. " CT CT 

AAATTATATAGTGATTATAAAATTAATCAAArXAATAATCATATCCCTTATTATAATTATTTAAA^^ 910 

ATAATAAATTACCTATTAAAAATATTATGGATATTAAAAATAATTATTGGTTAGCTGGTr^^ 980 

A A 
AGATGGTTCTTTTTTATCATCTATGTATAATCCTAAAGATACATTATTATTTAAAAATATGAGACCTAGT 1050 

A A 
TATGTTATTTCACAAGTTGAAACACGTAAAGAATTAATTTATTTAATTCA^ 1120 

TTTCTAATGTTAAAAAAGTTGGTAATAGAAAATTAAAAGATTTTA^ 1190 

, ;. • CT ■ 

ATTAATGAAATTTATTTATTATTTTGATAAA'rirTTT 12 6 0 

A ■ " ' • - ^ \ 

AAATTTAGATTTAATACTTTTATTAAATCATATAATTGGAATAATA<^^ - 133 0 

CTA A ' 

AATATATCAATAATATTAAi)ATTCL\TAATTATGATTATT ATTATTATAATAAAT^^ 1400 

■ ■ • ' _ . . A ■ 

TCCACGTAAACCTAAACGATACAtTAAATAA 1431. 



Nucleotide sequence of 50 kDa subunit.gene which has been modified 
to conform to universal code (nucleotides beneath the sequence 
represent those before the substitution) 
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Sequence-specific endonuclease 
activity of Endo-Scel 50 kDa subunit 

FIG. 5 A Substrate pY673L 



Ehdo.Scel 50 kDa subunit 




123 4 5 6 78 
FIG. 5 B Substrate pBR322 



Endo.Scel 50 kDa subunit 




1 2 3 4 5 6 7 
M ; ADNA Hind (H Marker 
S ; substrate DNA only 
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50 JcDa subunit from Saccharomyces uvarum 



FIG. 6A 



50 IcDa subunit (476 amino acid residues) 




217 . 346 





217 


346 


S . cerevisiae 
Endo . SceT 


Gly 


Asn 


S • uva,rum 
Endo • SuVI 


Lys 


Asp. 



FIG.6B 

3 5 GTTATATTGGTCCTAAAGGTATTACAATTA 

(GG) 

3 6 . ATTATTTAAAGATATGAGA 

(A) 
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Sequence-specific endonuclease 
activity of Endo.SuvI 50 JcDa subunit 



Endo.SuvI 50 kDa subunit 



FIG. 7A S 
Substrate pY673L 




FIG, 7 B 

Substrate pBR322 



M; ADNAHindlll Marker 
S ; substrate DNA only 



1 2 3 4 5 6 7 
Endo.Scel 50 kDa subunit 




2 3 4 5 6 7 

Endo.SuvI 50 kDa subunit 




1 2 3 4 5 6 7 



43 



